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(54) VOICE INPUT DEVICE 

(57)Abstract: 

PURPOSE: To attain the automatic switching between 
the processing contents to be given to the voice 
information thai is inputted next and an output mode 
based on the using state and/or the using history of an 
input device which inputs the information other than 
those on voices. 

CONSTITUTION: The signal showing a use/non-use 
state or an under-use state is fetched from each of input 
devices 2a, 2b. ..2n such as a keyboard, a mouse, etc., 
which input the information other than those on voices. 
Then the fetched signal is sent to a deciding part 7. The 
part 7 decides the contents of the voice information 
supplied from a voice input part 1 based on the using 
states and/or the using histories of the devices 2a, 2b... 

2n. Based on the contents of this decision, an instruction is given to a voice processing part 5 
to perform a prescribed processing and also an instruction is given to an output part 6 to 
perform an output operation in a prescribed mode. 
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CLAIMS 

(57) [Claim(s)] 
[Claim 1] 

In the audio input unit equipped with the voice input section, the input 
unit which inputs information other than voice, and the speech 
processing section which performs predetermined processing to the 
speech information inputted from said voice input section, 
The audio input unit which judges the content of the processing which 
should perform whether said input unit is using it based on the signal of 
said input unit to the inputted speech information according to the input 
unit which detected and was detected [ that it is under activity, and ], 
and is characterized by providing the judgment section which controls 
said speech processing section according to the judged result. 
[Claim 2] 

Said judgment section has the configuration file to which the content of 
the speech information inputted into the input unit detected [ that it is 
under activity and ] was set, 

The audio input unit according to claim 1 which judges the content of 
the processing which should be performed to the inputted speech 
information based on the input unit detected [ that it is under activity, 
and ], and said configuration file, and is characterized by controlling said 
speech processing section according to the judged result. 
[Claim 3] 

Said configuration file matches speech information with application a 
default or in use, and is set up, 

Said judgment section detects the signal of application in use, and it 
judges whether the existence of application in use and application are 
using it, 
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The audio input unit according to claim 2 which judges the content of 
the processing which should be performed to the inputted speech 
information based on the input unit detected [ that it is under activity, 
and ], the application judged that are under activity, and said 
configuration file, and is characterized by controlling said speech 
processing section according to the judged result. 
[Claim 4] 

Said speech processing section is equipped with two or more voice 
dictionaries and the dictionary change section which chooses any one or 
the plurality of two or more of these voice dictionaries. 
It is an audio input unit given in claim 1 which said judgment section 
outputs the selection command of a voice dictionary to said dictionary 
change section according to the judged result, and is characterized by 
said dictionary change section changing a voice dictionary based on said 
selection command thru/or any 1 term of 3. 
[Claim 5] 

In the audio input unit equipped with the voice input section, the input 
unit which inputs information other than voice, the speech processing 
section which performs predetermined processing to the speech 
information inputted from said voice input section, and the output 
section which outputs the result processed in this speech processing 
section, 

The audio input unit characterized by to provide the judgment section 
which judges the output mode of the content of processing which should 
perform whether said input unit is using it based on the signal of said 
input unit to the inputted speech information according to the input unit 
which detected and was detected [ that it is under activity, and ], and 
this processing result, and controls said speech-processing section and 
said output section according to this judgment result. 
[Claim 6] 

It is the audio input unit according to claim 5 characterized by making it 
have operated in order to output the mode which defined said output 
section beforehand regardless of the judgment result of the judgment 
section when the recognition result of the speech processing section to 
the inputted speech information was the word defined beforehand. 
[Claim 7] 

Said input device is an audio input unit given in claim 1 characterized by 
being a keyboard and/or a mouse thru/or any 1 term of 6. 



[Translation done.] 
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* * NOTICES * 

JPO and INPIT are not responsible for any 
damages caused by the use of this translation. 

1 This document has been translated by computer. So the translation 

may not reflect the original precisely. 

2.**** shows the word which can not be translated. 

3.1n the drawings, any words are not translated. 



DETAILED DESCRIPTION 

[Detailed Description of the Invention] 
[0001] 

[Industrial Application] 

This invention relates to the audio input unit which enabled the change 
of modification of the content of an output of the speech information 
into which the content of processing which should be given to the 
inputted speech information was changed and inputted automatically, 
without using a switch etc. 
[0002] 

[Description of the Prior Art] 

Drawing 7 is the block diagram showing the configuration of the audio 
input unit in the former, and one in drawing is the voice input sections, 
such as a microphone, 2a, and 2b. — 2n of input units which input other 
information other than voice, such as a keyboard and a mouse, is shown. 

The speech information inputted from the voice input section 1 is 
inputted into the speech recognition section 5. 
The speech recognition section 5 is set as the processing mode 
according to each, such as the speech information beforehand inputted 
with a switch 21, for example, text information, and command 
information. When a processing mode is text information processing 
Mohd, the dictionary for text information processing is read from the 
dictionary storing section 22. Based on this, in being command 
information processing Mohd, it reads the dictionary for command 
information processing from the dictionary storing section 22, and 
recognition processing of the speech information inputted based on this 
is performed, and a recognition result is outputted to the processing 
result output section 6. 

The processing result output section 6 is also set as the output mode 
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corresponding to the speech information beforehand inputted with a 
switch 21, and outputs the inputted recognition result as a command as 
a text with the input from other input units 2a-2n, respectively. 
[0003] 

[Problem(s) to be Solved by the Invention] 

By the way, therefore, the object inputted through the voice input 
section 1 changes each time, when it is text, such as a text, and when it 
is the operating instructions to application, a window manager, and OS, 
or when it is a voice data point. 
[0004] 

Since the content of the processing which should be performed in the 
speech recognition section 5 to such various objects for an input, and 
the procedure of processing also differed from self-**, if the speech 
recognition section 5 needed to be switched to the processing mode 
which was adapted for each object for an input and it was in the former, 
the switch 21 was switched by hand control or voice input, and the 
processing mode was set up. This point is the same also in the 
processing result output section 6. 
[0005] 

A user a switch 21 to switch with hand control However, a keyboard in 
use, or — from a mouse etc. — once — a hand — not detaching, in 
order not to obtain, to interrupt actuation of a keyboard and a mouse 
and to switch by voice input When it was naturally necessary to register 
the special command for a change, the incorrect recognition resulting 
from the conversation in the circumferences other than a noise and 
other input voice etc. arose, and when the operator did not expect, 
there was inconvenience — the change of a processing mode and 
output mode may be performed suddenly. 
[0006] 

This invention is made in view of this situation, and the place made into 
the object is to enable it to make automatically modification of the 
processing which should be performed to input voice, and a change of an 
output mode, without requiring actuation special to an operator. 
Other objects of this invention are to enable it to switch a voice 
dictionary automatically according to the speech information which 
performed speech recognition processing to the speech information 
inputted in the speech processing section, and was inputted 
[ command / the text, ] into the case. 
[0007] 

The object of further others of this invention is to judge this 



file://C:¥Documents%20and%20Settings¥saeko¥My%20Documents¥JPOEn¥JP-B-... 2007/02/15 



Ul D-JJOU / / t 



w / f— v> 



automatically in the judgment section and enable control to the output 
section, also when outputting to the inputted speech information, 
without performing speech processing. 

Although the usual computers, such as a keyboard and a mouse, are 
equipped with the object of further others of this invention, it is to 
enable far-reaching application by supposing that the judgment section 
judges based on an operating condition and/or activity hysteresis. 
[0008] 

When the inputted speech information is the word defined beforehand, 
regardless of the judgment result of the judgment section, the object of 
further others of this invention is making the mode defined beforehand 
output from the output section, and is shown in attaining the increase in 
efficiency of processing. 

The object of further others of this invention is to enable modification of 
the content of the judgment processing in the judgment section if 
needed, and is to close applicability, if amplification is still more possible. 

[0009] 

[Means for Solving the Problem] 

It explains based on principle drawing showing the principle of this 
invention in drawing 1 below. 

Drawing 1 is principle drawing of the audio input unit concerning this 
invention, and one in drawing shows input units which input information 
other than voice, such as a keyboard and a mouse, the voice input 
section, 2a, and 2b~2n. 

The speech information inputted from the voice input section 1 is 
inputted into the speech processing section 5 as digital information. 
[0010] 

On the other hand, input units [ 2a-2n ] an operating condition and/or 
activity hysteresis are crowded judgment section 7 HE picking serially. 
The content of the judgment processing to which the judgment section 7 
was beforehand set based on these operating conditions and/or activity 
hysteresis, The speech information inputted into the speech information 
currently actually inputted from the voice input section 1 according to 
an algorithm or a degree namely, in the thing of what kind of content for 
example, text input or command input — or data without the need of 
processing in any way in the speech processing section etc. are judged, 
and a predetermined command is given to the speech processing section 
5 and the output section 6 based on this judgment result, respectively. 
[0011] 
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" Generally, in the case of the operator who became skillful in the input of 
a keyboard, usually, the direction inputted using a keyboard can process 
promptly rather than inputting a text etc. by voice input. Therefore, 
when using the keyboard, objects it is considered that want to dare 
carry out voice input in the skilled operator are operating instructions, 
such as opening of a window, and a mode change of application, in many 
cases. On the other hand, when a mouse is under activity, to inputting a 
text etc., a hand is once lifted from a mouse, and a text is inputted using 
a keyboard, and since the actuation which returns a hand to a mouse is 
needed again, if you want to input text not only by operating instructions 
but by voice input, it will want in many cases. 

That is, it becomes possible to judge the output mode of the content of 
processing, and the output section 6 which should be given in the 
speech processing section 5 to input voice by grasping the keyboard 
which is the input device which inputs information other than voice, the 
activity hysteresis of a mouse, and an operating condition. 
[0012] 

To the speech processing section 5, to the speech information inputted, 
when speech recognition processing should be performed [ whether 
speech recognition processing should be performed and ] again, as a 
text, the command of whether to perform recognition processing as a 
command is given, and the speech processing section 5 is controlled. 
Moreover, that it should output as a command that it should output as a 
text to the output section 6 when the recognition result given from the 
speech processing section 5 is a text when it is a command, about the 
content from which it was not further processed at all in the speech 
processing section 5, a command is given that this should be outputted 
to a wave editor, for example, and the output section 6 is controlled. 
[0013] 

The speech processing section 5 outputs this to the output section 6 by 
this, without performing predetermined speech processing to the speech 
information which mode setting of the response of was made possible to 
the speech information inputted according to the command from the 
judgment section 7, and was inputted, or processing. 
Moreover, similarly the output section 6 follows a command from the 
judgment section 7, and, in the case of text input, the input from the 
speech processing section 5 is input-device 2a of the others [ case / of 
the command input as a text ] as command input, and 2b. — It outputs 
to a word processor, a wave editor, etc. like the input from 2n. 
[0014] 
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In addition, when the recognition result of the inputted speech 
information is the specific word defined beforehand, the output section 
is good also as giving priority over other modes to the output of the 
mode defined beforehand, and performing it. 

Moreover, an operator's content of judgment processing of the judgment 
section 7 is possible for modification to arbitration, and an operator 
enables it to have expanded the applicable range by making a judgment 
result change according to the object for application. 
[0015] 

In the audio input unit with which the audio input unit concerning the 1st 
invention was equipped with the speech processing section which 
performs predetermined processing to the voice input section, the input 
unit into which information other than voice is inputted, and the speech 
information into which it was inputted from said voice input section The 
content of the processing which should perform whether said input unit 
is using it based on the signal of said input unit to the inputted speech 
information according to the input unit which detected and was detected 
[ that it is under activity and ] is judged, and it is characterized by 
providing the judgment section which controls said speech processing 
section according to the judged result. 
[0016] 

The audio input unit concerning the 2nd invention is characterized by to 
have the configuration file to which the content of the speech 
information inputted into the input unit detected [ that said judgment 
section is under activity in the 1st invention and ] was set, to judge the 
content of the processing which should perform to the inputted speech 
information based on the input unit detected [ that it is under activity, 
and ], and said configuration file, and to control said speech-processing 
section according to the result which judged. 
[0017] 

The audio input unit concerning the 3rd invention is set to the 2nd 
invention. Said configuration file Speech information is matched with 
application a default or in use, and it has set up. Said judgment section 
Detect the signal of application in use and it judges whether the 
existence of application in use and application are using it. Based on the 
input unit detected [ that it is under activity and ], the application judged 
that are under activity, and said configuration file, the content of the 
processing which should be performed to the inputted speech 
information is judged, and it is characterized by controlling said speech 
processing section according to the judged result. 
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'[0018] 

The audio input unit which the 4th invention requires is set to the 1st 
thru/or the 3rd invention. The voice dictionary of plurality [ section / 
said / speech processing ], It has the dictionary change section which 
chooses any one or the plurality of two or more of these voice 
dictionaries, said judgment section outputs the selection command of a 
voice dictionary to said dictionary change section according to the 
judged result, and said dictionary change section is characterized by 
changing a voice dictionary based on said selection command. 
[0019] 

The input unit into which the audio input unit concerning the 5th 
invention inputs information other than voice as the voice input section, 
In the audio input unit equipped with the speech processing section 
which performs predetermined processing to the speech information 
inputted from said voice input section, and the output section which 
outputs the result processed in this speech processing section It 
detects whether based on the signal of said input unit, said input unit is 
using it. According to the input unit detected [ that it is under activity 
and ], the output mode of the content of processing which should be 
given to the inputted speech information, and this processing result is 
judged, and it is characterized by providing the judgment section which 
controls said speech processing section and said output section 
according to this judgment result. 
[0020] 

When the audio input unit concerning the 6th invention is the word 
which the recognition result of the speech processing section to the 
inputted speech information defined beforehand in the 5th invention, said 
output section is characterized by making it have operated in order to 
output the mode defined beforehand regardless of the judgment result of 
the judgment section. 
[0021] 

The audio input unit concerning the 7th invention is characterized by 

said input devices being a keyboard and/or a mouse in the 1st thru/or 

the 6th invention. 

[0022] 

[Function] 

If it is in the 1st invention, the processing which the judgment section 
should perform to input voice according to whether the input unit which 
inputs information other than voice is using it will be judged 
automatically, it will direct to the speech processing section, and an 
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operator does not need directions of the content of processing. 
[0023] 

If it is in the 2nd invention, the processing which the judgment section 
should perform to input voice based on the content of whether the input 
unit which inputs information other than voice is using it, and the 
configuration file will be judged automatically, it will direct to the speech 
processing section, and an operator does not need directions of the 
content of processing. 
[0024] 

If it is in the 3rd invention, the processing which the judgment section 
should perform to input voice based on the content of whether the input 
unit which inputs information other than voice is using it, and the 
configuration file will be judged automatically, it will direct to the speech 
processing section, and an operator does not need directions of the 
content of processing. 
[0025] 

If it is in the 4th invention, a change becomes possible automatically in 
the dictionary change section about two or more dictionaries used in the 
speech processing section for speech recognition. 
[0026] 

If it is in the 5th invention, it becomes possible to perform control to 
these automatically with the judgment section outputting a command to 
the speech processing section and each output section according to a 
judgment result based on whether the input unit which inputs 
information other than voice is using it. 
[0027] 

If it is in the 6th invention, it is making the mode beforehand set to the 
output section output to the recognition result beforehand defined 
regardless of the judgment result of the judgment section, and 
operability may be improved while reducing malfunction. 
[0028] 

If it is in the 7th invention, it is using the information about whether a 

keyboard and a mouse using it, and becomes applicable to a general 

purpose computer equipped with a keyboard and a mouse. 

[0029] 

[Example] 

(Example 1) 

This invention is concretely explained based on the drawing in which the 
example is shown below. 

Drawing 2 is the block diagram showing the configuration at the time of 
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" applying the audio input unit concerning this invention to the word 
processor 1 1 with a graphic form edit function, and the voice input 
section by which one in drawing was constituted from a microphone etc., 
2a, and 2b show input units which input information other than voice, 
such as a keyboard and a mouse. 

The speech information inputted from the voice input section 1 is 
inputted into the speech processing section 5 which the analog signal 
was changed by the digital signal and consisted of the A/D-conversion 
sections 3 as the speech recognition section. 
[0030] 

Input unit 2a which, on the other hand, inputs information other than 
voice, and the input from 2b are inputted into WORD PUSESSA 1 1, and 
also they are incorporated serially to the judgment section 7. 
The judgment section 7 judges the content of the processing which 
should be performed to the content of the speech information which 
recognizes these operating conditions and/or activity hysteresis 
according to input unit 2a, such as a keyboard and a mouse, and the 
content of judgment processing beforehand set up based on the signal 
from 2b, i.e., an algorithm, and is actually inputted through the voice 
input section 1 , and is inputted behind, and the speech information 
inputted. 

The output mode from the content and the output section 6 of the 
processing which the specifically inputted speech information should 
perform to text information, command information, and a list in the 
speech processing section 5 to each information is judged, and the 
command according to each is given to the dictionary change section 8 
and the output section 6. 
[0031] 

In addition, when the sensor which detects the existence of that activity 
is attached to each input unit 2a, such as a keyboard and a mouse, and 
2b, it is good also as incorporating this sensor output in the judgment 
section 7, and judging by recognizing these operating conditions and 
activity hysteresis. 

The dictionary change section 8 reads the dictionary for texts, or the 
dictionary 10 for commands to the speech processing section 5 by the 
command from the judgment section 7. 
[0032] 

The speech processing section 5 reads the dictionary 9 for texts or the 
dictionary 10 for /commands selectively changed in the dictionary 
change section 8 which operates based on the command from said 
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' judgment section 7, performs recognition processing of speech 
information based on these, and outputs a recognition result to the 
output section 6. Based on the directions from said judgment section 7, 
in the case of the command input as a text, the output section 6 
outputs this to a word processor 1 1 as a command, when speech 
information is text input. 
[0033] 

Next, it explains according to the flow chart which shows actuation of 
this invention equipment to drawin g 3 . 

Drawing 3 is a flow chart which shows a processing process in case the 
judgment section 7 judges based on what an input device present in use 
is. first — input unit 2a in use and 2b — or [ any ] — judging — (step 
S1) text input [ as opposed to / in the speech information inputted when 
an input unit in use is a mouse / the document under edit with a word 
processor 1 1 ] — judging — (step S2) Moreover, in being a keyboard, 
the speech information inputted judges with the command input to a 
word processor 1 1 . (step S3) The directions which correspond to the 
dictionary change section 8 and the output section 6 based on each 
judgment are outputted. 
[0034] 

Next, an example is given and the content of processing is explained. 
For example, if it utters with "save" in order to take backup of a text 
when an operator is creating a text with a word processor 1 1 using a 
keyboard Recognize that an operator is keyboard using the judgment 
section 7, and the inputted speech information judges it like the above- 
mentioned to be the command input to a word processor 1 1 . A command 
is outputted that the dictionary 10 for commands should be chosen to 
the dictionary change section 8, and it directs by making the recognition 
result of the speech recognition section into a command to the output 
section 6 that it should output to a word processor 1 1 . 
[0035] 

Consequently, the speech information inputted in the speech processing 
section 5 constituted as the speech recognition section is recognized to 
be "save" using the dictionary 10 for commands, and that recognition 
result is outputted to the output section 6. The output section 6 makes 
a recognition result "save" a command "save", it outputs to a word 
processor 1 1 , and a word processor 1 1 saves the document under edit 
in response to a command "save." 
[0036] 

Moreover, a location to put "Japanese" first is specified with a mouse in 
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' order to write the text "Japanese" in the predetermined part in the 
graphic form as that to which the operator is performing graphic form 
edit with the word processor 1 1 , and suppose that it uttered with 
"Japanese." The judgment section 7 judges the voice inputted as the 
operator recognizes and mentioned that a mouse was using it above to 
be the text input to the document under edit of a word processor 1 1 , 
and directs it that the dictionary 9 for texts should be chosen as the 
dictionary change section 8, and is directed that it should output as a 
text display to the output section 6. 
[0037] 

By this, the speech processing section 5 recognizes the inputted speech 
information to be "Japanese" using the dictionary 9 for texts, and 
outputs this recognition result to the output section 6. The output 
section 6 uses "Japanese" as a text, it outputs to a word processor 1 1 
and a word processor 1 1 indicates the "Japanese" which is a text by 
insertion in the directions location by the mouse. 
[0038] 
(Example 2) 

The example 2 is performing edit to the speech information inputted 
using the wave editor 1 2, and shows the case where there is no need 
that the speech processing section 5 constituted as the speech 
recognition section to the inputted speech information performs special 
processing. 

Dra win g 4 is the block diagram showing the configuration of the example 
2 of this invention. In this example 2, the phonetic memory section 4 is 
infixed in the medium of the A/D-conversion section 3 and the speech 
processing section 5 constituted as the speech recognition section, and 
it is made to have outputted the directions from the judgment section 7 
also to this phonetic memory section 4 besides the dictionary change 
section 8 and the output section 6. Moreover, it is made to be given to 
the judgment section 7 in the signal which shows that the output from 
the output section 6 other than the output from input-device 2a, such 
as a keyboard and a mouse, and each 2b is inputted, and the wave editor 
12 is [ wave editor ] under activity from the wave editor 12. 
[0039] 

The judgment section 7 outputs a command that it should recognize that 
it is data for wave edit of the content of the inputted speech information 
based on input-device 2a, such as a keyboard and a mouse, the signal 
which shows the activity of 2b, and the signal which shows the activity 
of the wave editor from the wave editor 1 2, and a command should 
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' output that voice should be memorized to the phonetic-memory section 
4, and the wave should output to the wave editor 1 2 to the output 
section 6. 

Drawing 5 ' s a f° w chart which shows the processing process of the 
judgment section 7. first — the case where the speech information 
judge that is the object which does not perform recognition processing 
of as opposed to this only by outputting the speech information (wave) 
inputted when the inputted speech information judged whether it was 
under [ sound-recording ] ****** in the phonetic-memory section 4 
(step S1 1) and was recording (step S12) and which was inputted on the 
other hand is not being recorded — an input device in use — a 
keyboard — or a mouse judges (step S13). 
[0040] 

When the inputted speech information is judged to be command input in 
the case of a keyboard (step S14) and a mouse is under activity, it 
judges whether there is any demand of an alphabetic character input 
(step S15), when there is nothing, the inputted speech information is 
judged to be command input (step S14), and in being, the inputted 
speech information judges with text input (step S16). 
[0041] 

An operator inputs his voice through a microphone concretely (sound 
recording), the wave is edited, and the case where it is under activity 
that an edit result should be saved at a file is explained. 
An operator utters toward a microphone and starts the sound recording 
of his voice. Under such a condition, the information on the purport 
which voice is recording from the wave editor 12 to the judgment 
section 7 is inputted. By this, to the speech information inputted in the 
speech processing section 5, the judgment section 7 does not need to 
perform audio recognition processing, and judges only with it being that 
the wave may only be outputted. 

The judgment section 7 is directed that the voice wave which ordered 
that the speech information inputted to the phonetic memory section 4 
based on this judgment should be recorded, and was inputted to the 
output section 6 should be outputted to the wave editor 1 2 as it is. In 
addition, since speech recognition processing is not needed to the 
dictionary change section 8, a command is not outputted. 
[0042] 

Consequently, after the speech information digitized in the A/D- 
conversion section 3 is recorded in the phonetic memory section 4, it is 
outputted to the direct-output section 6, and the output section 6 
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" outputs the inputted voice wave to the wave editor 12. 
An operator starts wave-like edit, after the sound recording of utterance 
is completed. The wave editor 1 2 processes that the wave specified 
supposing it performed echo processing to the wave which the operator 
specified the wave-like section using the mouse, and was specified that 
it utters with an "echo" and uttered with "the clearance" should be 
eliminated. 
[0043] 

That is, when a current keyboard is using it, the judgment section 7 
judges with the speech information inputted based on the signal inputted 
through the signal and the wave editor 1 2 in which it is shown that it is 
under activity from a keyboard being a command. 
It orders in order, as for the judgment section 7, to send voice to the 
speech processing section 5 to the phonetic memory section 4 
according to this, and it orders that the dictionary 10 for commands 
should be chosen to the dictionary change section 8, and it orders in 
order to send a command to the wave editor 12 to the output section 6. 
[0044] 

Consequently, the speech processing section 5 performs recognition 
processing to the speech information inputted using the dictionary 1 0 
for commands, recognizes the "echo" or "a clearance" which is input 
voice, and outputs this to the output section 6. 

The output section 6 makes a command the "echo" or "a clearance" 
which it is as a result of recognition, and delivery and this command are 
executed to the wave editor 1 2. 

Next, suppose that it uttered with "save" that the content which the 
operator edited should be saved. If this "save" name-unestablished- 
law-files and puts in another way, when it will be a new file, the wave 
editor 1 2 requires the name of a file. Then, when it utters with "its 
voice" as a file name, even if a mouse is under activity, since the wave 
editor 12 requires text input, it judges the speech information into which 
the judgment section 7 was inputted to be a text. 
[0045] 

The judgment section 7 is directed that a command should be outputted 
that the dictionary 9 for texts should be chosen to the dictionary change 
section 8, and "its voice" as a text should be outputted to the output 
section 6. Consequently, the speech processing section 5 performs 
recognition processing to speech information using the dictionary 9 for 
texts, and outputs this to the output section 6. The output section 6 is 
outputted to the wave editor 12 by using as a text "one's voice" which 
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it is as a result of recognition, and "its voice" which is a file name is 
inputted into the wave editor 1 2, and it is saved. 

If it is in such an example 2, even if the alphabetic character input like a 
file name or the numerical input of the delay time like an "echo" does 
not move a hand from the mouse under actuation to a keyboard, the 
input of it is attained by voice input. 
[0046] 

In addition, no matter the judgment result of the judgment section 7 may 
be what thing, when the result of speech recognition is "the specific 
word" defined beforehand, the output section 6 is good about which [ of 
examples 1 and 2 ] case, also as performing the predetermined output 
control defined beforehand. 

For example, when a specific word is "resizing" which is the operator 
command to a window manager and OS, or a predetermined name of a 
person, for example, "Tanaka", if there is the output section 6 in 
"resizing", it outputs the operator command for size change of a window 
to a window manager and OS. 
[0047] 

when the speech information into which application was under actuation 
under the predetermined window, and the judgment section 7 was 
inputted is judged to be a command to application in "resizing", it 
becomes equivalent to having set up the priority of the content of 
processing over input voice substantially using a recognition result, and 
operability is markedly alike and improves. 

Moreover, the misconception depended also when there is a possibility 
that the voice of "Tanaka" may mix frequently from a perimeter by 
setting up the content of processing of the judgment section so that 
[ "so that this may be disregarded" ], when "Tanaka" who is the name 
of a person currently used widely is inputted from the voice input 
section 1 can be avoided. 
[0048] 
(Example 3) 

Although the content of judgment processing of what kind of judgment 
to make perform based on input unit 2a, the operating condition of 2b, 
and activity hysteresis to the judgment section 7 in the examples 1 and 
2 explained the case where it was initialized on the occasion of start up 
of a voice input system, it can be reset in this example 3 at the event of 
arbitration. 
[0049] 

The content [ in / in drawing 6 (a) / the judgment section 7 ] of 
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judgment processing, i.e., the initialization processing process of an 
algorithm, and drawing 6 (b) are the flow charts of each setting-out 
modification processing process of an algorithm. 

first, a ****** [ that, as for initial setting of an algorithm, an initialization 
file exists at the time of starting of an audio input unit ] — judging — 
(step S21) the case where it does not exist — the time of "setting out 
of a proper", for example, a keyboard activity, — command input — 
moreover, judgment processing made into text input is set up at the 
time of a mouse activity (step S22) . 
[0050] 

Moreover, when the configuration file which a user will set up according 
to liking if it puts in another way when it exists exists, priority is given 
over "setting out of a proper" in the first half, and the judgment section 
7 reads an initialization file, (step S23) It sets up according to the 
content of this initialization file, (step S24) The current content of 
setting out is saved at a configuration file, (step S25) . 
[0051] 

On the other hand, in resetting, a user changes a configuration file using 
an editor etc. (step S31), and a new configuration file is reset according 
to read in (step S32) and this read content of a configuration file (step 
S33). 
[0052] 

Next, an example is given and explained. 

Suppose that the content of the now, for example, initialization, file was 
a thing like a table 1 . 
[0053] 
[A table 1] 
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[0054] 

The speech information inputted if the judgment section 7 which read 
such an initialization file judged the speech information inputted if it was 
at the time of a keyboard activity in the default case to be command 
input and it was at the time of a mouse activity will be disregarded. 
[0055] 

Moreover, if the window of a text editor is active when the operator is 
using the text editor, at the time of a keyboard activity, the speech 
information into which the inputted speech information was inputted with 
command input again at the time of a mouse activity will be judged to be 
text input. 

On the other hand, under such an initialization file, when an operator 
tries to use a wave editor while using the audio input unit, since setting 
out of a voice data point cannot be performed in this initialization file, a 
configuration file is reset. 

Now, suppose that the file of resetting was as shown in a table 2. 

[0056] 

[A table 2] 



2 







V V X j 












-r + X YATl 
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•r + X hA^J 


x + X h Ail 



[0057] 
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" By this, the speech information inputted in the speech information into 
which the judgment section 7 was inputted when it was [ keyboard ] 
under activity when a mouse was under activity with command input 
again is judged under the situation of using the wave editor now, 
respectively to be a wave input. 

Even if it is under the situation of merely using the wave editor, and it is 
using any of a keyboard and a mouse at the time of a file name input, 
the judgment section 7 will judge the inputted speech information to be 
text input. 
[0058] 

The judgment section 7 carries out judgment processing automatically 
according to this by setting up the algorithm of a judgment that the 
speech information inputted if it was during the mouse activity should be 
judged to be a "text" by making into a "command" speech information 
inputted if it was in such an example 3 and was to the judgment section 
7 while using a keyboard. By this, the handling to the voice inputted as 
actuation of an operator becomes cooperative, and working efficiency 
improves. 
[0059] 

[Effect of the Invention] 

If it is in the 1st invention, it is judging what kind of processing being 
made to perform to the speech processing section based on whether 
the input unit into which the judgment section inputs information other 
than voice is using it, based on this judgment result, processing of the 
speech processing section will be changed automatically, and an 
operator becomes possible [ carrying out automatic processing only by 
utterance ], without performing special actuation. 
[0060] 

If it is in the 2nd invention, it is judging what kind of processing being 
made to perform to the speech processing section based on the content 
of whether the input unit into which the judgment section inputs 
information other than voice is using it, and the configuration file, based 
on this judgment result, processing of the speech processing section will 
be changed automatically, and an operator becomes possible [ carrying 
out automatic processing only by utterance ], without performing special 
actuation. 
[0061] 

If it is in the 3rd invention, it is judging what kind of processing being 
made to perform to the speech processing section based on the content 
of whether the input unit into which the judgment section inputs 
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information other than voice is using it, and the configuration file, based 
on this judgment result, processing of the speech processing section will 
be changed automatically, and an operator becomes possible [ carrying 
out automatic processing only by utterance ], without performing special 
actuation. 
[0062] 

If it is in the 4th invention, in performing speech recognition in the 
speech processing section, various kinds of dictionaries are selected 
automatically, without an operator performing special directions, and it 
makes possible read-out to the speech processing section. 
[0063] 

It is judging what kind of processing being made to perform to the 
speech processing section based on whether if it is in the 5th invention, 
the input unit into which the judgment section inputs information other 
than voice is using it, based on this judgment result, processing of the 
speech processing section will be changed automatically, and an 
operator becomes possible [ carrying out automatic processing only by 
utterance ], without performing special actuation. 
[0064] 

if it is in the 6th invention, when the specific voice beforehand defined 
regardless of the judgment result of the judgment section is inputted, 
incorrect recognition is reduced by making the ranking of the highest 
priority defined beforehand process — operability both improves. 
[0065] 

If it is in the 7th invention, it becomes it can be large to a keyboard and 
a general purpose computer equipped with a mouse, and applicable to 
them. 

[Brief Description of the Drawings] 

[ Dra wing 1] It is principle drawing of this invention. 

[Drawing 2] It is the block diagram showing the configuration of the 

example 1 of this invention. 

[Drawing 3] It is the flow chart which shows the processing process of 
the judgment section in an example 1. 

[Drawing 4] It is the block diagram showing the configuration of an 
example 2. 

[Drawing 5] It is the flow chart which shows the processing process of 
the judgment section in an example 2. 

[Drawing 6] It is the flow chart which shows the initialization process 
and setting-out modification process of the content of judgment 
processing of the judgment section in an example 3. 
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" [ Drawin g 7] It is the block diagram showing the configuration of 
equipment conventionally. 
[Description of Notations] 

I Voice Input Section 
2a-2n Input unit 

5 Speech Processing Section 

6 Output Section 

7 Judgment Section 

8 Dictionary Change Section 

9 Dictionary for Texts 

10 Dictionary for Commands 

I I Word Processor 
1 2 Wave Editor 



[Translation done.] 
* NOTICES * 

JPO and INPIT are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation 
may not reflect the original precisely. 

2. **** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



DESCRIPTION OF DRAWINGS 

[Brief Description of the Drawings] 

[D rawin g 1] It is principle drawing of this invention. 

[Drawin g 2] It is the block diagram showing the configuration of the 

example 1 of this invention. 

[Dra win g 3] It is the flow chart which shows the processing process of 
the judgment section in an example 1. 

[D ra wing 4] It is the block diagram showing the configuration of an 
example 2. 

[Drawing 5] It is the flow chart which shows the processing process of 
the judgment section in an example 2. 

[Drawin g 6] It is the flow chart which shows the initialization process 
and setting-out modification process of the content of judgment 
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processing of the judgment section in an example 3. 
[Drawing 7] It is the block diagram showing the configuration of 
equipment conventionally. 
[Description of Notations] 

I Voice Input Section 
2a-2n Input unit 

5 Speech Processing Section 

6 Output Section 

7 Judgment Section 

8 Dictionary Change Section 

9 Dictionary for Texts 

10 Dictionary for Commands 

I I Word Processor 
1 2 Wave Editor 



[Translation done.] 
* NOTICES * 

JPO and INPIT are not responsible for any 
damages caused by the use of this translation. 

1 This document has been translated by computer. So the translation 

may not reflect the original precisely. 

2.**** shows the word which can not be translated. 

3.1n the drawings, any words are not translated. 



DRAWINGS 
[Drawing 1] 
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[Draw ing 2] 
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[Draw in g 3] 
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[Drawing 4] 
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[Drawing 5] 
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[ Drawing 6] 
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[Drawin g 7] 



file://C:¥Documents%20and%20Settings¥saeko¥My%20Documents¥JPOEn¥JP-B-... 2007/02/15 



2a~ ] A^ggl - 
2* H Aflggl 

2 H a^b") - 



22 
f 



21 



[Translation done.] 
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